Scene Graph Generation


Scene graph generation is the process of creating structured representations of scenes that capture the relationships between objects.

PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions

Add code
Oct 21, 2025
Viaarxiv icon

Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics

Add code
Sep 26, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning

Add code
Sep 26, 2025
Viaarxiv icon

Causal Reasoning Elicits Controllable 3D Scene Generation

Add code
Sep 18, 2025
Viaarxiv icon

Measuring Epistemic Humility in Multimodal Large Language Models

Add code
Sep 11, 2025
Viaarxiv icon

Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation

Add code
Sep 09, 2025
Viaarxiv icon

SATURN: Autoregressive Image Generation Guided by Scene Graphs

Add code
Aug 20, 2025
Viaarxiv icon

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring

Add code
Aug 07, 2025
Viaarxiv icon